11 research outputs found

    A Dependable Hybrid Machine Learning Model for Network Intrusion Detection

    Full text link
    Network intrusion detection systems (NIDSs) play an important role in computer network security. There are several detection mechanisms where anomaly-based automated detection outperforms others significantly. Amid the sophistication and growing number of attacks, dealing with large amounts of data is a recognized issue in the development of anomaly-based NIDS. However, do current models meet the needs of today's networks in terms of required accuracy and dependability? In this research, we propose a new hybrid model that combines machine learning and deep learning to increase detection rates while securing dependability. Our proposed method ensures efficient pre-processing by combining SMOTE for data balancing and XGBoost for feature selection. We compared our developed method to various machine learning and deep learning algorithms to find a more efficient algorithm to implement in the pipeline. Furthermore, we chose the most effective model for network intrusion based on a set of benchmarked performance analysis criteria. Our method produces excellent results when tested on two datasets, KDDCUP'99 and CIC-MalMem-2022, with an accuracy of 99.99% and 100% for KDDCUP'99 and CIC-MalMem-2022, respectively, and no overfitting or Type-1 and Type-2 issues.Comment: Accepted in the Journal of Information Security and Applications (Scopus, Web of Science (SCIE) Journal, Quartile: Q1, Site Score: 7.6, Impact Factor: 4.96) on 7 December 202

    Pooled analysis of WHO Surgical Safety Checklist use and mortality after emergency laparotomy

    Get PDF
    Background The World Health Organization (WHO) Surgical Safety Checklist has fostered safe practice for 10 years, yet its place in emergency surgery has not been assessed on a global scale. The aim of this study was to evaluate reported checklist use in emergency settings and examine the relationship with perioperative mortality in patients who had emergency laparotomy. Methods In two multinational cohort studies, adults undergoing emergency laparotomy were compared with those having elective gastrointestinal surgery. Relationships between reported checklist use and mortality were determined using multivariable logistic regression and bootstrapped simulation. Results Of 12 296 patients included from 76 countries, 4843 underwent emergency laparotomy. After adjusting for patient and disease factors, checklist use before emergency laparotomy was more common in countries with a high Human Development Index (HDI) (2455 of 2741, 89.6 per cent) compared with that in countries with a middle (753 of 1242, 60.6 per cent; odds ratio (OR) 0.17, 95 per cent c.i. 0.14 to 0.21, P <0001) or low (363 of 860, 422 per cent; OR 008, 007 to 010, P <0.001) HDI. Checklist use was less common in elective surgery than for emergency laparotomy in high-HDI countries (risk difference -94 (95 per cent c.i. -11.9 to -6.9) per cent; P <0001), but the relationship was reversed in low-HDI countries (+121 (+7.0 to +173) per cent; P <0001). In multivariable models, checklist use was associated with a lower 30-day perioperative mortality (OR 0.60, 0.50 to 073; P <0.001). The greatest absolute benefit was seen for emergency surgery in low- and middle-HDI countries. Conclusion Checklist use in emergency laparotomy was associated with a significantly lower perioperative mortality rate. Checklist use in low-HDI countries was half that in high-HDI countries.Peer reviewe

    Global variation in anastomosis and end colostomy formation following left-sided colorectal resection

    Get PDF
    Background End colostomy rates following colorectal resection vary across institutions in high-income settings, being influenced by patient, disease, surgeon and system factors. This study aimed to assess global variation in end colostomy rates after left-sided colorectal resection. Methods This study comprised an analysis of GlobalSurg-1 and -2 international, prospective, observational cohort studies (2014, 2016), including consecutive adult patients undergoing elective or emergency left-sided colorectal resection within discrete 2-week windows. Countries were grouped into high-, middle- and low-income tertiles according to the United Nations Human Development Index (HDI). Factors associated with colostomy formation versus primary anastomosis were explored using a multilevel, multivariable logistic regression model. Results In total, 1635 patients from 242 hospitals in 57 countries undergoing left-sided colorectal resection were included: 113 (6·9 per cent) from low-HDI, 254 (15·5 per cent) from middle-HDI and 1268 (77·6 per cent) from high-HDI countries. There was a higher proportion of patients with perforated disease (57·5, 40·9 and 35·4 per cent; P < 0·001) and subsequent use of end colostomy (52·2, 24·8 and 18·9 per cent; P < 0·001) in low- compared with middle- and high-HDI settings. The association with colostomy use in low-HDI settings persisted (odds ratio (OR) 3·20, 95 per cent c.i. 1·35 to 7·57; P = 0·008) after risk adjustment for malignant disease (OR 2·34, 1·65 to 3·32; P < 0·001), emergency surgery (OR 4·08, 2·73 to 6·10; P < 0·001), time to operation at least 48 h (OR 1·99, 1·28 to 3·09; P = 0·002) and disease perforation (OR 4·00, 2·81 to 5·69; P < 0·001). Conclusion Global differences existed in the proportion of patients receiving end stomas after left-sided colorectal resection based on income, which went beyond case mix alone

    A Hybrid Dependable Deep Feature Extraction and Ensemble-Based Machine Learning Approach for Breast Cancer Detection

    No full text
    Breast cancer is a prevalent and life-threatening disease that requires effective detection and diagnosis methods to improve patient outcomes. Deep learning (DL) and machine learning (ML) techniques have emerged as powerful tools in breast cancer detection, offering benefits such as improved accuracy and efficiency. However, existing methods have scalability and performance limitations, emphasizing the need for further research. In this paper, we propose a hybrid dependable breast cancer detection approach that combines the power of DL using a pre-trained ResNet50V2 model and ensemble-based ML methods. The integration of DL enables the approach to learn and extract hidden patterns from complex breast cancer images, while ML algorithms contribute interpretability and generalization capabilities. We conducted extensive experiments using a breast histopathology image-based publicly available Invasive Ductal Carcinoma (IDC) dataset comprising samples of different sizes. The results obtained from our rigorous experiments provide compelling evidence for our hybrid model&#x2019;s robustness and high performance. We achieved a higher accuracy rate of 95&#x0025;, precision of 94.86&#x0025;, recall of 94.32&#x0025;, and F1 score of 94.57&#x0025; compared to state-of-the-art models. We also identified Light Boosting Classifier (LGB) as the most suitable ML model in conjunction with the ResNet50V2 architecture. The results of this research offer significant contributions to breast cancer detection through an innovative approach, comprehensive performance analysis, and dependable assessment. Moreover, it has the potential to assist medical professionals in making informed decisions, improving patient care, and enhancing outcomes for breast cancer patients

    An ensemble machine learning based bank loan approval predictions system with a smart application

    No full text
    Banks rely heavily on loans as a primary source of revenue; however, distinguishing deserving applicants who will reliably repay loans presents an ongoing challenge. Conventional selection processes often struggle to identify the most suitable candidates from a pool of loan applicants. In response to this challenge, we present an innovative machine learning (ML) based loan prediction system designed to identify qualified loan applicants autonomously. This comprehensive study encompasses data preprocessing, effective data balancing using SMOTE, and the implementation of diverse ML models, including Logistic Regression, Decision Tree, Random Forest, Extra Trees, Support Vector Machine, K-Nearest Neighbors, Gaussian Naive Bayes, AdaBoost, Gradient Boosting, and advanced deep learning models such as deep neural networks, recurrent neural networks, and long short-term memory models. The model's performance is rigorously assessed in terms of accuracy, recall, and F1_score. Our experimental analysis reveals that the Extra Trees outperforms its counterparts. Furthermore, we successfully predict bank loan defaulters through an ensemble voting model, which includes the top three ML models, achieving a remarkable 0.62% increase in accuracy compared to the Extra Trees. To facilitate user interaction, we have developed a user-friendly desktop-based application. Notably, our findings demonstrate that the voting-based ensemble model surpasses both individual ML models, including Extra Trees, and existing state-of-the-art approaches, achieving an impressive accuracy of 87.26%. This innovative system has the potential to significantly streamline and enhance the efficiency of bank loan approval processes, ultimately benefiting both financial institutions and loan applicants alike

    Machine learning based diabetes prediction and development of smart web application

    No full text
    Diabetes is a very common disease affecting individuals worldwide. Diabetes increases the risk of long-term complications including heart disease, and kidney failure among others. People might live longer and lead healthier lives if this disease is detected early. Different supervised machine learning models trained with appropriate datasets can aid in diagnosing the diabetes at the primary stage. The goal of this work is to find effective machine-learning-based classifier models for detecting diabetes in individuals utilizing clinical data. The machine learning algorithms to be trained with several datasets in this article include Decision tree (DT), Naive Bayes (NB), k-nearest neighbor (KNN), Random Forest (RF), Gradient Boosting (GB), Logistic Regression (LR) and Support Vector Machine (SVM). We have applied efficient pre-processing techniques including label-encoding and normalization that improve the accuracy of the models. Further, using various feature selection approaches, we have identified and prioritized a number of risk factors. Extensive experiments have been conducted to analyze the performance of the model using two different datasets. Our model is compared with some recent study and the results show that the proposed model can provide better accuracy of 2.71% to 13.13% depending on the dataset and the adopted ML algorithm. Finally, a machine learning algorithm showing the highest accuracy is selected for further development. We integrate this model in a web application using python flask web development framework. The results of this study suggest that an appropriate preprocessing pipeline on clinical data and applying ML-based classification may predict diabetes accurately and efficiently

    A dependable hybrid machine learning model for network intrusion detection

    No full text
    Network intrusion detection systems (NIDSs) play an important role in computer network security. There are several detection mechanisms where anomaly-based automated detection outperforms others significantly. Amid the sophistication and growing number of attacks, dealing with large amounts of data is a recognized issue in the development of anomaly-based NIDS. However, do current models meet the needs of today's networks in terms of required accuracy and dependability? In this research, we propose a new hybrid model that combines machine learning and deep learning to increase detection rates while securing dependability. Our proposed method ensures efficient pre-processing by combining SMOTE for data balancing and XGBoost for feature selection. We compared our developed method to various machine learning and deep learning algorithms in order to find a more efficient algorithm to implement in the pipeline. Furthermore, we chose the most effective model for network intrusion based on a set of benchmarked performance analysis criteria. Our method produces excellent results when tested on two datasets, KDDCUP’99 and CIC-MalMem-2022, with an accuracy of 99.99% and 100% for KDDCUP’99 and CIC-MalMem-2022, respectively, and no overfitting or Type-1 and Type-2 issues.</p

    DeepCrop: Deep learning-based crop disease prediction with web application

    No full text
    Agriculture plays a significant role in every nation's economy by producing crops. Plant disease identification is one of the most important aspects of maintaining an agriculturally developed nation. The timely and efficient detection of plant diseases is essential for a healthy and productive agricultural sector and to prevent wasting money and other resources. Various diseases that could affect a plant cause crop farmers to lose a substantial sum yearly. Deep learning can play a crucial role in helping farmers prevent crop failure by early disease detection in plant leaves. In the experiment, we examined CNN, VGG-16, VGG-19 and ResNet-50 models on plant-village 10000 image dataset to detect crop infection and got the accuracy rate of 98.60%, 92.39%, 96.15%, and 98.98% for CNN, VGG-16, VGG-19 and ResNet-50 respectively. The study indicates that ResNet-50 outperforms the other models with an accuracy of 98.98%. So, the ResNet50 model was chosen to be developed into a smart web application for real-life crop disease prediction. The proposed web application aims to assist farmers in identifying diseases of plants by analyzing photos of the plant leaves. The proposed application uses the ResNet50 transfer learning model at its heart to distinguish healthy and infected leaves and classify the present disease type. The goal is to help farmers save resources and prevent economic loss by detecting plant diseases early and applying the appropriate treatment

    Smart Contract-Based Access Control Framework for Internet of Things Devices

    No full text
    The Internet of Things (IoT) has recently attracted much interest from researchers due to its diverse IoT applications. However, IoT systems encounter additional security and privacy threats. Developing an efficient IoT system is challenging because of its sophisticated network topology. Effective access control is required to ensure user privacy in the Internet of Things. Traditional access control methods are inappropriate for IoT systems because most conventional access control approaches are designed for centralized systems. This paper proposes a decentralized access control framework based on smart contracts with three parts: initialization, an access control protocol, and an inspection. Smart contracts are used in the proposed framework to store access control policies safely on the blockchain. The framework also penalizes users for attempting unauthorized access to the IoT resources. The smart contract was developed using Remix and deployed on the Ropsten Ethereum testnet. We analyze the performance of the smart contract-based access policies based on the gas consumption of blockchain transactions. Further, we analyze the system’s security, usability, scalability, and interoperability performance

    Cancer Classification Utilizing Voting Classifier with Ensemble Feature Selection Method and Transcriptomic Data

    No full text
    Biomarker-based cancer identification and classification tools are widely used in bioinformatics and machine learning fields. However, the high dimensionality of microarray gene expression data poses a challenge for identifying important genes in cancer diagnosis. Many feature selection algorithms optimize cancer diagnosis by selecting optimal features. This article proposes an ensemble rank-based feature selection method (EFSM) and an ensemble weighted average voting classifier (VT) to overcome this challenge. The EFSM uses a ranking method that aggregates features from individual selection methods to efficiently discover the most relevant and useful features. The VT combines support vector machine, k-nearest neighbor, and decision tree algorithms to create an ensemble model. The proposed method was tested on three benchmark datasets and compared to existing built-in ensemble models. The results show that our model achieved higher accuracy, with 100% for leukaemia, 94.74% for colon cancer, and 94.34% for the 11-tumor dataset. This study concludes by identifying a subset of the most important cancer-causing genes and demonstrating their significance compared to the original data. The proposed approach surpasses existing strategies in accuracy and stability, significantly impacting the development of ML-based gene analysis. It detects vital genes with higher precision and stability than other existing methods
    corecore